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DETAILED ACTION 
Response to Amendment 

1 . Applicant's arguments with respect to claims 1-12 have been considered but are 
moot in view of the new ground(s) of rejection in view of Allinger (DE 19747745) 
necessitated by claim amendment. 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 1 , 3, 5-8, and 1 1 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Junqua et al. (US 6415257) in view of Partovi et al. (US 6807574), 
and further in view of Allinger (DE 19747745). 

4. Regarding claims 1 and 7, Junqua et al. disclose a dialog system and a method 
of operating a dialog system (figure 3) comprising processing units for automatic speech 
recognition (12 of figure 1), natural language understanding (24 of figure 1), defining 
system outputs in dependence on information derived from user inputs (col. 2, In. 28- 
31), generating acoustic and/or visual system outputs (col. 10 In. 65 to col. 1 1 , In. 3 
and/or element 36 of figure 1 ), deriving user models (col. 2, In. 36-42, input speech 
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signal is processed and parameterized for used in the speech recognition process), 
from determined details about the style of speech of user inputs and/or details about 
interactions in dialogs between users and the dialog system and adaptation of contents 
and/or form of system outputs in dependence on the user models (col. 2, In. 54 to col. 3, 
In. 26 or referring to figure 2, the user's profile includes a log that keeps track of user's 
view preferences and user's speech patterns). 

Junqua et al. fail to specifically disclose wherein the style of speech is 
determined based on factor selected from the group consisting of: the number of polite 
phrases used, address used, speech level, information density, vocabulary and use of 
foreign words, number of different words and classification of words of speech inputs 
with respect to rare occurrence; and defining system outputs in dependence on 
information derived from user inputs, which includes an experience level, wherein the 
system output is based on the experience level of the user model in that if the 
experience level is low, the stem output is a first length, while if the experience level is 
high, the system output is a second length lesser than the first length. However, Partovi 
et al. teach wherein the style of speech is determined based on speech level {col. 12, 
lines 36-55, "southern dialect" reads on speech level as defined in paragraph 18 of the 
application) and classification of words of speech inputs with respect to rare occurrence 
(col. 13, lines 36-52, "San Francisco" are rare occurrence words). 

Since Junqua et al. and Partovi et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
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art at the time of invention to modify Junqua et al. by incorporating the teaching of 
Partovi et al. in order to improve speech recognition accuracy. 

The modified Junqua et al. fail to specifically disclose defining system outputs in 
dependence on information derived from user inputs, which includes an experience 
level, wherein the system output is based on the experience level of the user model in 
that if the experience level is low, the stem output is a first length, while if the 
experience level is high, the system output is a second length lesser than the first 
length. However, Allenger further teaches defining system outputs in dependence on 
information derived from user inputs, which includes an experience level, wherein the 
system output is based on the experience level of the user model in that if the 
experience level is low, the stem output is a first length, while if the experience level is 
high, the system output is a second length lesser than the first length {page 6, line 34 to 
page 7, line 37). 

Since the modified Junqua et al. and Allenger are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Junqua et al. by incorporating the 
teaching of Allenger in order to enable dialog to be shorten for users. 

5. Regarding claim 8, Junqua et al. disclose a process for television-user dialog, 
comprising the steps of: receiving user speech input (element 10 in figure 1)\ processing 
the speech input using automatic speech recognition and natural language 
understanding (elements 12 and 24 in figure 1)\ and defining at least one system output 
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based on the speech input and a user model derived from details of the user style of 
speech inputs (col. 2, lines 54 to col. 3, line 67, speech/speaker adaptation). 

Junqua et al. fail to specifically disclose wherein the style of speech is 
determined based on factor selected from the group consisting of: the number of polite 
phrases used, address used, speech level, information density, vocabulary and use of 
foreign words, number of different words and classification of words of speech inputs 
with respect to rare occurrence. However, Partovi et al. teach wherein the style of 
speech is determined based on speech level {col. 12, lines 36-55, "southern dialect" 
reads on speech level as defined in paragraph 18 of the application) and classification 
of words of speech inputs with respect to rare occurrence {col. 13, lines 36-52, "San 
Francisco" are rare occurrence words). 

Since Junqua et al. and Partovi et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Junqua et al. by incorporating the teaching of 
Partovi et al. in order to improve speech recognition accuracy. 

The modified Junqua et al. fail to specifically disclose wherein the system output 
is based on the experience level of the user model in that if the experience level is low, 
the stem output is a first length, while if the experience level is high, the system output 
is a second length lesser than the first length. However, Allenger further teaches 
wherein the system output is based on the experience level of the user model in that if 
the experience level is low, the stem output is a first length, while if the experience level 
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is high, the system output is a second length lesser than the first length {page 6, line 34 
to page 7, line 37). 

Since the modified Junqua et al. and Allenger are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Junqua et al. by incorporating the 
teaching of Allenger in order to enable dialog to be shorten for users. 

6. Regarding claim 3, Junqua et al. further disclose a dialog system characterized in 
that the user models contain estimates for the reliability of recognition results derived 
from user inputs (col. 7, In. 1-32, the score associated with each candidate represents 
the reliability of each recognized candidate). 

7. Regarding claim 5, Junqua et al. further disclose a dialog system characterized in 
that fixed models of user stereotypes are used for forming the user models (col. 8, In. 8- 
26, a speaker adaptation process). 

8. Regarding claim 6, Junqua et al. further disclose a dialog system characterized in 
that user models are used which are continuously updated based on inputs of the 
respective user (col. 3, In. 1-27, the system includes a usage log recording user's 
everyday uses of the system). 
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9. Regarding claim 1 1 , Junqua et al. further disclose the process of Claim 8, 
wherein the step of defining comprises the step of: defining at least one system output 
based on the speech input and a user model which includes a familiarity level, wherein 
the system output is based on the familiarity level (col. 3, lines 1-25, familiarity level is 
determined by how often and/or how long the user has used the system and that is 
specified in the usage log). 

10. Claims 2, 4, and 10 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Junqua et al. (US 6415257) in view of Partovi et al. (US 6807574), further in view 
of Allinger (DE 19747745), as applied to claims 1 and 8, and further in view of Larsen 
(IEEE Publication). 

1 1 . Regarding claim 2, Junqua et al. further disclose a dialog system characterized in 
that in addition to the input modality to use user inputs by means of speech, at least a 
further input modality is provided (col. 3, In. 35-44). Junqua et al. do not disclose a 
dialog system characterized in that the user models contain details about the respective 
use of the various input modalities by the user. 

However, Larsen teaches a bi-modal application used in a dialog system, where 
a DTMF input mode is used if repeated recognition errors occur in the speech 
recognition mode (referring to APPLICATION SECTION on pages 66-67). The 
advantage of using the teaching of Larsen in Junqua et al. is to enable the system to 
take appropriate actions to process the input signal to achieve high accuracy. 
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Since Junqua et al. and Larsen are analogous art because they are from the 
same field of endeavors it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Junqua et al. by incorporating the teaching of Larsen in 
order to enable the system to take appropriate actions to process the input signal to 
achieve high accuracy. 

The modified Junqua et al. still fail to disclose a dialog system characterized in 
that the user models contain details about the respective use of the various input 
modalities by the user. However, it would have been obvious to one of ordinary skill in 
the art at the time of invention to readily realize that both DTMF and speech input 
modes, as taught by Larsen, are different and both are represented by two distinct 
signals. Therefore, the system would have distinguished and processed these two 
signals differently in order to enhance the system's efficiency and reliability. 

12. Regarding claim 4, Junqua et al. do not disclose a dialog system characterized in 
that in dependence on the estimates, system responses are generated which prompt 
the respective user to use such input modalities for which high estimate values were 
determined and/or which prevent the respective user from using input modalities for 
which low reliability values were determined. 

However, Larsen teaches a dialog system characterized in that in dependence 
on the estimates, system responses are generated which prompt the respective user to 
use such input modalities for which high estimate values were determined and/or which 
prevent the respective user from using input modalities for which low reliability values 
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were determined (referring to APPLICATION SECTION on pages 66-67). The 
advantage of using the teaching of Larsen in the modified Junqua et al. is to allow the 
system to switch to a different input mode to achieve high recognition accuracy. 

Since the modified Junqua et al. and Larsen are analogous art because they are 
from the same field of endeavors, it would have been obvious to one of ordinary skill in 
the art at the time of invention to further modify Junqua et al. by incorporating the 
teaching of Larsen in order to allow the system to switch to a different input mode to 
achieve high recognition accuracy. 

1 3. Regarding claim 1 0, Junqua et al. further teach the process of Claim 8, wherein 
the step of defining comprises the step of: defining at least one system output based on 
the speech input and a user model, wherein the system output is based on the likely 
input modality (col. 3, lines 1-67). Junqua et al. fail to specifically disclose a user model, 
which includes a likely input modality for a current prompt. However, Larsen teaches a 
user model, which includes a likely input modality for a current prompt (referring to 
APPLICATION SECTION on pages 66-67). 

Since Junqua et al. and Larsen are analogous art because they are from the 
same field of endeavors it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Junqua et al. by incorporating the teaching of Larsen in 
order to enable the system to take appropriate actions to process the input signal to 
achieve high accuracy. 
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14. Claim 12 is rejected under 35 U.S.C. 103(a) as being unpatentable over Junqua 
et al. (US 6415257) in view of Partovi et al. (US 6807574), further in view of Allinger (DE 
19747745), as applied to claim 8, and further in view of Toyama et al. (US 6502082). 

1 5. Regarding claim 1 2, Junqua et al. fails to specifically disclose the process of 
claim 8 further comprising the steps of: receiving a user face image: and determining a 
degree of despair based on the user face image {col. 1, lines 38-54)\ wherein the step 
of defining comprises the step of: defining at least one system output based on the 
degree of despair (co/. 1, lines 38-54). However, Toyama et al. teach the steps of: 
receiving a user face image: and determining a degree of despair based on the user 
face image (coL 1, lines 38-54); wherein the step of defining comprises the step of: 
defining at least one system output based on the degree of despair (coL 1, lines 38-54). 

Since Junqua et al. and Toyama et al. are analogous art because they are from 
the same field of endeavors it would have been obvious to one of ordinary skill in the art 
at the time of invention to modify Junqua et al. by incorporating the teaching of Toyama 
et al. in order to specify the system to provide appropriate services for the user. 

Conclusion 

Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 



Application/Control Number: 09/954,657 Page 1 1 

Art Unit: 2626 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen X. Vo whose telephone number is 571-272-7631 . 
The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on 571-272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

HXV 6/14/2006 




